Next Generation Cancer Data Discovery, Access, and Integration Using Prizms and Nanopublications

نویسندگان

  • James P. McCusker
  • Timothy Lebo
  • Michael Krauthammer
  • Deborah L. McGuinness
چکیده

To encourage data sharing in the life sciences, supporting tools need to minimize effort and maximize incentives. We have created infrastructure that makes it easy to create portals that supports dataset sharing and simplified publishing of the datasets as high quality linked data. We report here on our infrastructure and its use in the creation of a melanoma dataset portal. This portal is based on the Comprehensive Knowledge Archive Network (CKAN) and Prizms, an infrastructure to acquire, integrate, and publish data using Linked Data principles. In addition, we introduce an extension to CKAN that makes it easy for others to cite datasets from within both publications and subsequently-derived datasets using the emerging nanopublication and World Wide Web Consortium provenance standards.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Genome Annotation using Nanopublications: An Approach to Interoperability of Genetic Data

With the widespread use of Next Generation Sequencing (NGS) technologies, the primary bottleneck of genetic research has shifted from data production to data analysis. However, annotated datasets produced by different research groups are often in different formats, making genomic comparisons and integration with other datasets challenging and time consuming tasks. Here, we propose a new data in...

متن کامل

Finding Novel Associations Across Domains Using Linked Data: a Case Study on Genetic Variants Disrupting Transcription Start Sites

With the widespread use of Next Generation Sequencing technologies, the primary bottleneck of genetic research has shifted from data production to data analysis. However, heterogeneous data sets makes comparisons and integration challenging and time consuming. Here, we apply a data interoperability approach that provides unambiguous (machine readable) description of genomic annotations based on...

متن کامل

Using Nanopublications to Incentivize the Semantic Exposure of Life Science Information

The growing rate of data production in the life sciences creates an urgent need for semantic integration of information. Although the development of tools and infrastructure will make semantic data exposure easier with time, presently the effort associated with creating linked data remains largely unrecognized by peer-review processes, publishers, and promotion committees. Here, we describe a n...

متن کامل

Next Generation Sequencing and its Application in the Study of Microbiome in Plant Diseases Suppressive Soils

Progress in next-generation sequencing has played a significant role in ecological studies of microbial populations. These advances have led to a rapid evaluation in metagenomics studies (analysis of DNA of microbial communities without the need to culture). Many statistical and computational tools and metagenomics databases have led to the discovery of huge amounts of data. In this research, i...

متن کامل

Publishing DisGeNET as nanopublications

The increasing and unprecedented publication rate in the biomedical field is a major bottleneck for knowledge discovery in the Life Sciences. The manual curation of facts from published scientific papers is slow and inefficient, and therefore new approaches are needed that can enable the automatic, scalable and reliable extraction of assertions. While the publication of scientific assertions an...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Data integration in the life sciences : ... International Workshop, DILS ... : proceedings. DILS

دوره 7970  شماره 

صفحات  -

تاریخ انتشار 2013